AITopics | reparameterizing mirror descent

Reparameterizing Mirror Descent as Gradient Descent

Neural Information Processing SystemsFeb-8-2026, 14:55:41 GMT

Forthis, wefirstconsiderthe -trickon(18), inwhichwesetw(t)= w+(t) w (t) where log w+(t)= rwL(w(t)), log w (t)=+ rwL(w(t)).

artificial intelligence, machine learning, warmuth, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Santa Clara County > Mountain View (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.41)

Add feedback

Reparameterizing Mirror Descent as Gradient Descent

Neural Information Processing SystemsDec-24-2025, 02:28:47 GMT

Most of the recent successful applications of neural networks have been based on training with gradient descent updates. However, for some small networks, other mirror descent updates learn provably more efficiently when the target is sparse. We present a general framework for casting a mirror descent update as a gradient descent update on a different set of parameters. In some cases, the mirror descent reparameterization can be described as training a modified network with standard backpropagation. The reparameterization framework is versatile and covers a wide range of mirror descent updates, even cases where the domain is constrained. Our construction for the reparameterization argument is done for the continuous versions of the updates. Finding general criteria for the discrete versions to closely track their continuous counterparts remains an interesting open problem.

descent, descent update, reparameterizing mirror descent, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.62)

Add feedback

Reparameterizing Mirror Descent as Gradient Descent

Neural Information Processing SystemsMay-29-2025, 12:06:45 GMT

Most of the recent successful applications of neural networks have been based on training with gradient descent updates. However, for some small networks, other mirror descent updates learn provably more efficiently when the target is sparse. We present a general framework for casting a mirror descent update as a gradient descent update on a different set of parameters. In some cases, the mirror descent reparameterization can be described as training a modified network with standard backpropagation. The reparameterization framework is versatile and covers a wide range of mirror descent updates, even cases where the domain is constrained.

artificial intelligence, descent, machine learning, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Review for NeurIPS paper: Reparameterizing Mirror Descent as Gradient Descent

Neural Information Processing SystemsJan-24-2025, 22:00:56 GMT

Additional Feedback: Suggestions: Lack definition (anything that is not'common knowledge' should be defined and explained before using. Should not let readers guess.) 1. In eq(1), w and L is used without defined. Could first introduce the problem and mention L is loss or the target function, and w is the model parameter. 'coincides with' is not a commonly used, mathematically rigorous and clear expression.

descent, gradient descent, reparameterizing mirror descent, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.40)

Add feedback

Review for NeurIPS paper: Reparameterizing Mirror Descent as Gradient Descent

Neural Information Processing SystemsJan-24-2025, 22:00:49 GMT

The theoretical result of the paper is significant in my opinion. I also agree with the authors that the topic of this paper is at the core of machine learning and thus the paper should be evaluated based on its contributions. The reviewers also adjusted their reviews based on this point. However, I should also mention that the reviewers raised the concern that some of the definitions are omitted and few parts of the paper is not rigorous enough. Therefore, I suggest that the authors take care of such ambiguities in the final version.

descent, gradient descent, reparameterizing mirror descent, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.40)

Add feedback

Reparameterizing Mirror Descent as Gradient Descent

Neural Information Processing SystemsOct-10-2024, 08:24:34 GMT

Most of the recent successful applications of neural networks have been based on training with gradient descent updates. However, for some small networks, other mirror descent updates learn provably more efficiently when the target is sparse. We present a general framework for casting a mirror descent update as a gradient descent update on a different set of parameters. In some cases, the mirror descent reparameterization can be described as training a modified network with standard backpropagation. The reparameterization framework is versatile and covers a wide range of mirror descent updates, even cases where the domain is constrained.

descent, descent update, reparameterizing mirror descent, (2 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Filters

Collaborating Authors

reparameterizing mirror descent

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Reparameterizing Mirror Descent as Gradient Descent

Reparameterizing Mirror Descent as Gradient Descent

Reparameterizing Mirror Descent as Gradient Descent

Review for NeurIPS paper: Reparameterizing Mirror Descent as Gradient Descent

Review for NeurIPS paper: Reparameterizing Mirror Descent as Gradient Descent

Reparameterizing Mirror Descent as Gradient Descent